Across language families: Genome diversity mirrors linguistic variation within Europe

نویسندگان

  • Giuseppe Longobardi
  • Silvia Ghirotto
  • Cristina Guardiano
  • Francesca Tassi
  • Andrea Benazzo
  • Andrea Ceolin
  • Guido Barbujani
چکیده

OBJECTIVES The notion that patterns of linguistic and biological variation may cast light on each other and on population histories dates back to Darwin's times; yet, turning this intuition into a proper research program has met with serious methodological difficulties, especially affecting language comparisons. This article takes advantage of two new tools of comparative linguistics: a refined list of Indo-European cognate words, and a novel method of language comparison estimating linguistic diversity from a universal inventory of grammatical polymorphisms, and hence enabling comparison even across different families. We corroborated the method and used it to compare patterns of linguistic and genomic variation in Europe. MATERIALS AND METHODS Two sets of linguistic distances, lexical and syntactic, were inferred from these data and compared with measures of geographic and genomic distance through a series of matrix correlation tests. Linguistic and genomic trees were also estimated and compared. A method (Treemix) was used to infer migration episodes after the main population splits. RESULTS We observed significant correlations between genomic and linguistic diversity, the latter inferred from data on both Indo-European and non-Indo-European languages. Contrary to previous observations, on the European scale, language proved a better predictor of genomic differences than geography. Inferred episodes of genetic admixture following the main population splits found convincing correlates also in the linguistic realm. DISCUSSION These results pave the ground for previously unfeasible cross-disciplinary analyses at the worldwide scale, encompassing populations of distant language families.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Global distribution of genomic diversity underscores rich complex history of continental human populations.

Characterizing patterns of genetic variation within and among human populations is important for understanding human evolutionary history and for careful design of medical genetic studies. Here, we analyze patterns of variation across 443,434 single nucleotide polymorphisms (SNPs) genotyped in 3845 individuals from four continental regions. This unique resource allows us to illuminate patterns ...

متن کامل

Classification of the European language families by genetic distance.

Genetic distances among speakers of the European language families were computed by using gene-frequency data for human blood group antigens, enzymes, and proteins of 26 genetic systems. Each system was represented by a different subset of 3369 localities across Europe. By subjecting the matrix of distances to numerical taxonomic procedures, we obtained a grouping of the language families of Eu...

متن کامل

Language diversity and implications for Language technology in the Multilingual Europe

Europe has a particular and unique setting. On one had it has a great language diversity, there are twenty four official languages and a dozen of minority languages largely used. On the other hand most of these languages belong to one of the indo-European language families (Roman, Germanic Slavic) and within these language families similarities at lexical and syntactic level can be observed. Wh...

متن کامل

Language Diversity across the Consonant Inventories: A Study in the Framework of Complex Networks

In this paper, we attempt to explain the emergence of the linguistic diversity that exists across the consonant inventories of some of the major language families of the world through a complex network based growth model. There is only a single parameter for this model that is meant to introduce a small amount of randomness in the otherwise preferential attachment based growth process. The expe...

متن کامل

Genes and languages in Europe: an analysis of mitochondrial lineages.

When mitochondrial DNA sequence variation is analyzed from a sample of 637 individuals in 14 European populations, most populations show little differentiation with respect to each other. However, the Saami distinguish themselves by a comparatively large amount of sequence difference when compared with the other populations, by a different distribution of sequence diversity within the populatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 157  شماره 

صفحات  -

تاریخ انتشار 2015